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1 Resolving ambiguity for cross-language retrieval ioo% 

Lisa Ballesteros , W. Bruce Croft 

Proceedings of the 21st annual international ACM SIGIR conference on 
Research and development in information retrieval August 1998 

2 Phrasal translation and query expansion techniques for cross-language ioo% 
- 13 information retrieval 

Lisa Ballesteros , W. Bruce Croft 

ACM SIGIR Forum , Proceedings of the 20th annual international ACM SIGIR 
conference on Research and development in information retrieval July 1997 
Volume 31 Issue SI ^ ^ 

3 Improving query translation for cross-language information retrieval ioo% 
[2 using statistical models 

Jlanfeng Gao , Jian-Yun NIe , Endong Xun , Jian Zhang , Ming Zhou , Changning Huang 
Proceedings of the 24th annual international ACM SIGIR conference on 
Research and development in information retrieval September 2001 
Dictionaries have often been used for query translation in cross-language 
information retrieval (CLIR). However, we are faced with the problem of translation 
ambiguity, I.e. multiple translations are stored in a dictionary for a word. In 
addition, a word-by-word query translation is not precise enough. In this paper, we 
explore several methods to improve the previous dictionary-based query translation. 
First, as many as possible, noun phrases are recognized and translated as a whole 
by usin ... 
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3 CLIR 

Bin Wang , Xueqi Cheng , Shuo Bai 

Proceedings of the 25th annual international ACM SIGIR conference on 
Research and development in information retrieval August 2002 

This paper proposes an example-based phrase translation nnethod in a Chinese to 
English cross-language information retrieval (CLIR) system. The method can 
generate much more accurate query translations than dictionary-based and common 
MT-based methods, and then improves the retrieval performance of our CLIR 
system. 



5 Cross-language information retrieval with the UMLS metathesaurus ioo% 

Cft David Eichmann , Miguel E. Ruiz , Padmini Srinivasan 

— Proceedings of the 21st annual international ACM SIGIR conference on 
Research and development in information retrieval August 1998 



6 The effects of query structure and dictionary setups in dictionary-based 99% 
U cross-language infornnatlon retrieval 

Ah Pirkola 

Proceedings of the 21st annual international ACM SIGIR conference on 
Research and development in information retrieval August 1998 



7 Browsing by phrases: terminological information in interactive 99% 
U multilingual text retrieval 

Anselnno Penas , Julio Gonzalo , Felisa Verdejo 

Proceedings of the first ACM/IEEE-CS joint conference on Digital libraries 

January 2001 

This paper present an interactive search engine (Website Term Browser) which 
mal<es use of phrasal infornnatlon to process queries and suggest relevant topics in a 
fully multilingual setting. 



8 Exploiting a Chinese-English bilingual wordlist for English-Chinese cross 99% 
0^ language infornnation retrieval 

K. L. Kwok 

Proceedings of the fifth international workshop on on Infornnation retrieval with 
Asian languages Novennber 2000 

We investigated using the LDC English/Chinese bilingual wordlists for English- 
Chinese cross language retrieval. It is shown that the Chinese-to-English wordlist 
can be considered as both a phrase and word dictionary, and is preferable to the 
English-to-Chinese version in ternns of phrase translations and word translation 
selection. Additional techniques such as target corpus frequency-based term 
selection and weighting were employed. Experiments show that over 70% of 
monolingual effectiveness ... 



9 Cross-language multinnedia information retrieval 98% 

Sharon Flank 

Proceedings of the sixth conference on Applied natural language processing 

April 2000 

Simple measures can achieve high-accuracy cross-language retrieval in carefully 
chosen applications. Image retrieval is one of those applications, with results 
ranging from 68% of human translator performance for German, to 100% for 
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French. 



10 Improving the effectiveness of infornnation retrieval with local context 98% 
U analysis 

Jinxi Xu , W. Bruce Croft 

ACM Transactions on Information Systems (TOIS) January 2000 
Volume 18 Issue 1 

Techniques for autonnatic query expansion have been extensively studied in 
information research as a means of addressing the word mismatch between queries 
and documents. These techniques can be categorized as either global or local. While 
global techniques rely on analysis of a whole collection to discover word 
relationships, local techniques emphasize analysis of the top-ranked documents 
retrieved for a query. While local techniques have shown to be more effective that 
global techniques in ... 

11 Cross-language spoken document retrieval using HMM-based retrieval 97% 

12 model With multi-scale fusion 

Wai-Kit Lo , Helen Meng , P. C. Ching 

ACM Transactions on Asian Language Information Processing (TALIP) March 



Volume 2 Issue 1 

Cross-language spoken document retrieval (CL-SDR) is the technology that 
facilitates automatic retrieval of relevant Information from a collection of spoken 
documents in a language that is different from that used in the queries. Information 
sources that are in different languages can then be retrieved automatically with CL- 
SDR, and the number of searchable information sources will increase significantly. 
The HMM-based retrieval model Is a probabilistic formulation for the retrieval 
problem. Ext ... 

12 Automatic acquisition of phrasal knowledge for English-Chinese 96% 
12 bilingual information retrieval 

Ming-Jer Lee , Lee-Feng Chien 

Proceedings of the 21st annual international ACM SIGIR conference on 
Research and development in information retrieval August 1998 



13 Distributed Information Retrieval: Exploiting a controlled vocabulary to 9i% 
U improve collection selection and retrieval effectiveness 

James C. French , Allison L. Powell , Fredric Gey , Natalia Perelman 
Proceedings of the tenth international conference on Information and 
knowledge management October 2001 

Vocabulary incompatibilities arise when the terms used to index a document 
collection are largely unknown, or at least not well-known to the users who 
eventually search the collection. No matter how comprehensive or well-structured 
the indexing vocabulary, it Is of little use if it is not used effectively in query 
formulation. This paper demonstrates that techniques for mapping user queries into 
the controlled indexing vocabulary have the potential to radically improve document 
retrieval perform ... 

14 Corpus Linguistics: Effective arabic-english cross-language information 9i% 
12 retrieval via machine-readable dictionaries and machine translation 

Mohammed Aljlayl , Ophir Frieder 
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Proceedings of the tenth international conference on Information and 

knowledge management October 2001 

In Cross-Language Information Retrieval (CLIR), queries in one language retrieve 
relevant documents in other languages Macliine-Readable Dictionary (MRD) and 
Machine Translation (MT) are important resources for query translation in CLIR. We 
investigate MT and MRD to Arabic-English CLIR. The translation ambiguity 
associated with these resources is the key problem. We present three methods of 
query translation using a bilingual dictionary for Arabic-English CLIR. First, we 
present the Every-Match ( ... 



15 Combining multiple sources for short query translation in Chinese- 87% 
12 English cross-language information retrieval 

Altao Chen , Hailing Jiang , Fredric Gey 

Proceedings of the fifth international worl<shop on on Information retrieval with 
Asian languages November 2000 

In this paper, we examine various factors that affect the retrieval performance of 
Chinese-English cross-language retrieval. The factors Include segmentation 
dictionary coverage, segmentation algorithm, transfer dictionary coverage, transfer 
dictionary quality, and translation disambiguation. The paper introduces an idea of 
recovering the original English names for the transliterated Chinese words, mainly 
the proper name, using search engine. We used two transfer dictionaries and a 
Chinese se ... 



16 Cross-language Information Retrieval: Resolving query translation 87% 
12 ambiguity using a decaying co-occurrence model and syntactic 
dependence relations 

Jianfeng Gao , Ming Zhou , Jian-Yun Nie , Hongzhao He , Weijun Chen 
Proceedings of the 25th annual international ACM SIGIR conference on 
Research and development in information retrieval August 2002 

Bilingual dictionaries have been commonly used for query translation in cross- 
language information retrieval (CLIR). However, we are faced with the problem of 
translation selection. Several recent studies suggested the utilization of term co- 
occurrences in this selection. This paper presents two extensions to improve them. 
First, we extend the basic co-occurrence model by adding a decaying factor that 
decreases the mutual information when the distance between the terms increases. 
Second, we inco ... 



17 BI^IR-J2: a test collection for evaluation of Japanese information 85% 
12 retrieval systems 

Tetsuya Sakai , Tsuyoshi Kitani , Yasushi Ogawa , Tetsuya Ishlkawa , Haruo Kimoto , 
Ikuo Keshi , Jun Toyoura , Toshikazu Fukushima , Kunio Matsui , Yoshihiro Ueda , 
Takenobu Tokunaga , Hiroshi Tsuruoka , Hidekazu Nakawatase , Teru Agata , Noriko 
Kando 

ACM SIGIR Forum September 1999 
Volume 33 Issue 1 

BMIR-J2 is the first complete test collection generally available for evaluating 
Japanese information retrieval systems. BMIR-J2 features include a novel division of 
search requests based on various functions required to perform successful retrieval. 
BMIR-J2 and its smaller predecessor BMIR-Jl were constructed by a volunteer- 
based working group under the Information Processing Society of Japan. We hope 
that BMIR-J2 will come into wide use and that it will foster the development of 
Japanese IR sy ... 
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18 Translation of web queries using anchor text mining 85% 

Wen-Hsiang Lu , Lee-Feng Chien , Hsi-Jian Lee 

ACM Transactions on Asian Language Information Processing (TALIP) June 2002 
Volume 1 Issue 2 

This article presents an approach to automatically extracting translations of Web 
query terms through mining of Web anchor texts and link structures. One of the 
existing difficulties in cross-language information retrieval (CLIR) and Web search is 
the lack of appropriate translations of new terminology and proper names. The 
proposed approach successfully exploits the anchor-text resources and reduces the 
existing difficulties of query term translation. Many query terms that cannot be 
obtained in ... 



19 Cross-language Information Retrieval: Comparing cross-language 83% 
U query expansion techniques by degrading translation resources 

Paul McNamee , James Mayfield 

Proceedings of the 25th annual international ACM SIGIR conference on 

Research and development in information retrieval August 2002 

The quality of translation resources is arguably the most important factor affecting 
the performance of a cross-language information retrieval system. While many 
investigations have explored the use of query expansion techniques to combat 
errors induced by translation, no study has yet examined the effectiveness of these 
techniques across resources of varying quality. This paper presents results using 
parallel corpora and bilingual wordlists that have been deliberately degraded prior to 
query tr ... 



20 MT-based Japanese-Enlish cross-language IR experiments using the 82% 
TREC test collections 

Tetsuya Sakai 

Proceedings of the fifth international workshop on on Information retrieval with 
Asian languages November 2000 

This paper evaluates the effectiveness of MT-based Japanese-English CLIR using a 
subcollection of the TREC test collections and two bilingual researchers to separately 
translate the TREC requests into Japanese. Our main findings are as follows: (l)With 
the aid of pseudo-relevance feedback, MT-based J-E CLIR can be as effective as 
"best-case" monolingual IR. In particular, although poor MT quality often leads to 
poor initial CLIR performance, pseudo-relevance feedback is useful for ... 
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21 Information retrieval session 3: cross language retrieval: Addressing the 82% 
12 lack of direct translation resources for cross-language retrieval 

Lisa Ballesteros , Mark Sanderson 

Proceedings of the twelfth international conference on Information and 

knowledge management November 2003 

Most cross language information retrieval research concentrates on language pairs for 
which direct, rich, and often multiple translation resources already exist. However, for 
most language pairs, translation via an intermediate language is necessary. Two 
distinct methods for dealing with the additional ambiguity Introduced by the extra 
translation step have been proposed and Individually, shown to improve retrieval 
effectiveness. Two previous works Indicated that in combination, the methods were ... 



22 Cross-language Information Retrieval: Cross-lingual relevance models 

Victor Lavrenko , Martin Choquette , W. Bruce Croft 

Proceedings of the 25th annual international ACM SIGIR conference on Research 

and development in information retrieval August 2002 

We propose a formal model of Cross-Language Information Retrieval that does not 
rely on either query translation or document translation. Our approach leverages 
recent advances in language modeling to directly estimate an accurate topic model in 
the target language, starting with a query in the source language. The model 
integrates popular techniques of disambiguation and query expansion in a unified 
formal framework. We describe how the topic model can be estimated with either a 
parallel corpus ... 
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23 A first step towards flexible local feedback for ad hoc retrieval 

1^ Tetsuya Sakai , Masahiro Kajiura , Kazuo Sumita 

Proceedings of the fifth international workshop on on Information retrieval with 
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Asian languages November 2000 



Local feedback for ad hoc retrieval typically hurts performance for about one-third of 
the search requests while improving the average performance. Our objective is to 
make it more reliable by estimating the optimal number of assumed-relevant 
documents and the optimal number of expansion terms for each request. We examine 
some simple optimization methods based on: the number of case particles in the 
request; the number of initial search terms; the highest document score in the 
initial ... 



24 The impact on retrieval effectiveness of skewed frequency distributions 82% 

Mark Sanderson , C. J. Van Rijsbergen 
— ACM Transactions on Information Systems (TOIS) October 1999 
Volume 17 Issue 4 

We present an analysis of word senses that provides a fresh insight into the impact of 
word ambiguity on retrieval effectiveness with potential broader implications for other 
processes of information retrieval. Using a methodology of forming artifically 
ambiguous words, known as pseudowords, and through reference to other 
researchers' work, the analysis illustrates that the distribution of the frequency of 
occurrence of the senses of a word plays a strong role in ambiguity's impact of effe ... 



25 Cross-lingual Infornnation retrieval: Automatic transliteration for 80% 
U Japanese-to-English text retrieval 

Yan Qu , Gregory Grefenstette , David A. Evans 

Proceedings of the 26th annual international ACM SIGIR conference on Research 

and development in informaion retrieval July 2003 

For cross language information retrieval (CLIR) based on bilingual translation 
dictionaries, good perfornnance depends upon lexical coverage in the dictionary. This 
is especially true for languages possessing few inter-language cognates, such as 
between Japanese and English. In this paper, we describe a nnethod for automatically 
creating and validating candidate Japanese transliterated terms of English words. A 
phonetic English dictionary and a set of probabilistic mapping rules are used for 
autom ... 



26 Structured translation for cross-language infornnation retrieval 80% 

Ruth Sperer , Douglas W. Oard 
— Proceedings of the 23rd annual international ACM SIGIR conference on Research 
and development in information retrieval July 2000 



The paper introduces a query translation model that reflects the structure of the 
cross-language information retrieval task. The model is based on a structured 
bilingual dictionary in which the translations of each term are clustered into groups 
with distinct meanings. Query translation is modeled as a two-stage process, with the 
system first determining the intended meaning of a query term and then selecting 
translations appropriate to that meaning that might appear in the document 
collectio ... 



27 A month to topic detection and tracking in Hindi 77% 

James Allan , Victor Lavrenko , Margaret E, Connell 
— ACM Transactions on Asian Language Information Processing (TALIP) June 2003 
Volume 2 Issue 2 

We describe the one-month (June 2003) effort to create a topic detection and 
tracking (TDT) system to support news stories in Hindi. The University of 
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Massachusetts submitted results for three different TDT tasks in the DARPA surprise 
language evaluation. The official task was topic tracking, but we also provided results 
for the new event detection and topic detection (clustering) tasks. Our approach to all 
three tasks was based on the vector-space model of information retrieval. We also 
describ ... 



28 Word sense disambiguation for cross-language information retrieval 77% 

Mary Xiaoyong Liu , Ted Diamond , Anne R. Diekema 

Proceedings of the workshop on Student research workshop April 2000 

We have developed a word sense disambiguation algorithm, following Cheng and 
Wilensky (1997), to disambiguate among WordNet synsets. This algorithm is to be 
used in a cross-language Information retrieval system, CINDOR, which indexes 
queries and documents in a language-neutral concept representation based on 
WordNet synsets. Our goal is to improve retrieval precision through word sense 
disambiguation. An evaluation against human disambiguation judgements suggests 
promise for our approach. 



29 Posters: Assessing the effectiveness of pen-based input queries 77% 

Stephen Levin , Paul Clough , Mark Sanderson 

Proceedings of the 26th annual international ACM SIGIR conference on Research 

and development in informaion retrieval July 2003 

In this poster, we describe an experiment exploring the effectiveness of a pen based 
text input device for use in query construction. Standard TREC queries were written, 
recognised, and subsequently retrieved upon. Comparisons between retrieval 
effectiveness based on the recognised writing and a typed text baseline were made. 
On average, effectiveness was 75% of the baseline. Other statistics on the quality 
and nature of recognition are also reported. . 



30 Evaluating a probabilistic model for cross-lingual information retrieval 77% 

Jinxi Xu , Ralph Weischedel , Chanh Nguyen 

Proceedings of the 24th annual international ACM SIGIR conference on Research 

and development in information retrieval September 2001 

This work proposes and evaluates a probabilistic cross-lingual retrieval systenn. The 
system uses a generative model to estimate the probability that a document in one 
language is relevant, given a query in another language. An important component of 
the model is translation probabilities from terms in documents to terms in a query. 
Our approach is evaluated when 1) the only resource is a manually generated 
bilingual word list, 2) the only resource is a parallel corpus, and 3) both resour ... 

31 Improving cross language retrieval with triangulated translation 77% 

Tim Collins , Mark Sanderson 

Proceedings of the 24th annual international ACI^ SIGIR conference on Research 

and development in information retrieval September 2001 

Most approaches to cross language information retrieval assume that resources 
providing a direct translation between the query and document languages exist. This 
paper presents research examining the situation where such an assumption is false. 
Here, an intermediate (or pivot) language provides a means of transitive translation 
of the query language to that of the document via the pivot, at the cost, however, of 
Introducing much error. The paper reports the novel approach of translating ... 



32 Workshop on patent retrieval SIGIR 2000 workshop report 



77% 



Noriko Kando , Mun-Kew Leong 
ACM SIGIR Forum April 2000 
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1 Corpus Linguistics: Effective arabic-english cross-language information ioo% 
[2 retrieval via machine-readable dictionaries and machine translation 

Mohannnned Aljlayl , Ophir Frieder 

Proceedings of the tenth international conference on Information and 

knowledge management October 2001 

In Cross-Language Information Retrieval (CLIR), queries in one language retrieve 
relevant documents in other languages Machine-Readable Dictionary (MRD) and 
Machine Translation (MT) are important resources for query translation in CLIR. We 
investigate MT and MRD to Arabic-English CLIR. The translation ambiguity 
associated with these resources is the key problem. We present three methods of 
query translation using a bilingual dictionary for Arabic-English CLIR. First, we 
present the Every-Match ( ... 



2 Distributed Information Retrieval: Exploiting a controlled vocabulary to ioo% 
13 improve collection selection and retrieval effectiveness 

James C. French , Allison L. Powell , Fredric Gey , Natalia Perelman 
Proceedings of the tenth international conference on Information and 
knowledge management October 2001 

Vocabulary incompatibilities arise when the terms used to index a document 
collection are largely unknown, or at least not well-known to the users who 
eventually search the collection. No matter how comprehensive or well-structured 
the indexing vocabulary, it is of little use if it is not used effectively in query 
formulation. This paper demonstrates that techniques for mapping user queries into 
the controlled indexing vocabulary have the potential to radically improve document 
retrieval perform ... 



3 Automatic construction of parallel English-Chinese corpus for cross- ioo% 
language information retrieval 

Jiang Chen , Jian-Yun Nie 
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Proceedings of the sixth conference on Applied natural language processing 

April 2000 

A major obstacle to the construction of a probabilistic translation model is the lack of 
large parallel corpora. In this paper we first describe a parallel text mining system 
that finds parallel texts automatically on the Web. The generated Chinese-English 
parallel corpus is used to train a probabilistic translation model which translates 
queries for Chinese-English cross-language information retrieval (CLIR). We will 
discuss some problems in translation model training and show the preliminary C ... 



4 Cross-language multimedia information retrieval ioo% 

Sharon Flank 

— Proceedings of the sixth conference on Applied natural language processing 

April 2000 

Simple measures can achieve high-accuracy cross-language retrieval in carefully 
chosen applications. Image retrieval is one of those applications, with results 
ranging from 68% of human translator performance for German, to 100% for 
French. 



5 A month to topic detection and tracking in Hindi ioo% 

James Allan , Victor Lavrenko , Margaret E. Connell 

ACM Transactions on Asian Language Information Processing (TALIP) June 2003 
Volume 2 Issue 2 

We describe the one-month (June 2003) effort to create a topic detection and 
tracking (TDT) system to support news stories in Hindi. The University of 
Massachusetts submitted results for three different TDT tasks in the DARPA surprise 
language evaluation. The official task was topic tracking, but we also provided 
results for the new event detection and topic detection (clustering) tasks. Our 
approach to all three tasks was based on the vector-space model of information 
retrieval. We also describ ... 



6 Information retrieval session 3: cross language retrieval: A novel ioo% 
method for stemmer generation based on hidden markov models 

Massimo Melucci , Nicola Orio 

Proceedings of the twelfth international conference on Information and 

knowledge management November 2003 

In this paper, we present a method based on Hidden Markov Models (HMMs) to 
generate statistical stemmers. Using a list of words as training set, the method 
estimates the HMM parameters which are used to calculate the most probable stem 
for an arbitrary word. Stemming Is performed by computing the most probable path, 
through the HMM states, corresponding to the input word. Linguistic knowledge or a 
training set of manually stemmed words are not required. We describe the method 
and the results of ... 



7 Cross-lingual information retrieval: Automatic transliteration for ioo% 
13 Japanese-to-English text retrieval 

Yan Qu , Gregory Grefenstette , David A. Evans 

Proceedings of the 26th annual international ACM SIGIR conference on 

Research and development in informaion retrieval July 2003 

For cross language Information retrieval (CLIR) based on bilingual translation 
dictionaries, good performance depends upon lexical coverage in the dictionary. This 
is especially true for languages possessing few inter-language cognates, such as 
between Japanese and English. In this paper, we describe a method for 
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automatically creating and validating candidate Japanese transliterated ternns of 
Englisli words. A phonetic English dictionary and a set of probabilistic mapping rules 
are used for autom ... 

8 Translation of web queries using anchor text mining ioo% 

Wen-Hsiang Lu , Lee-Feng Chien , Hsi-Jian Lee 
— ACM Transactions on Asian Language Information Processing (TALIP) June 2002 

Volume 1 Issue 2 

This article presents an approach to automatically extracting translations of Web 
query terms through mining of Web anchor texts and link structures. One of the 
existing difficulties in cross-language information retrieval (CLIR) and Web search is 
the lack of appropriate translations of new terminology and proper names. The 
proposed approach successfully exploits the anchor-text resources and reduces the 
existing difficulties of query term translation. Many query terms that cannot be 
obtained in ... 

9 Browsing by plirases: terminological information in interactive ioo% 
12 multilingual text retrieval 

Anselmo Penas , Julio Gonzalo , Felisa Verdejo 

Proceedings of the first ACM/IEEE-CS joint conference on Digital libraries 

January 2001 

This paper present an interactive search engine (Website Term Browser) which 
makes use of phrasal information to process queries and suggest relevant topics in a 
fully multilingual setting. 

10 Improving cross language retrieval witli triangulated translation ioo% 

Tim Collins , Mark Sanderson 

Proceedings of the 24th annual international ACM SIGIR conference on 
Research and development in information retrieval September 2001 

Most approaches to cross language information retrieval assume that resources 
providing a direct translation between the query and document languages exist. This 
paper presents research examining the situation where such an assumption is false. 
Here, an intermediate (or pivot) language provides a means of transitive translation 
of the query language to that of the document via the pivot, at the cost, however, of 
introducing much error. The paper reports the novel approach of translating ... 

11 Combining multiple sources for short query translation in Chinese- ioo% 

12 English cross-language information retrieval 

Aitao Chen , Hailing Jiang , Fredric Gey 

Proceedings of the fifth international workshop on on Information retrieval with 
Asian languages November 2000 

In this paper, we examine various factors that affect the retrieval performance of 
Chinese-English cross-language retrieval. The factors include segmentation 
dictionary coverage, segmentation algorithm, transfer dictionary coverage, transfer 
dictionary quality, and translation disambiguation. The paper introduces an idea of 
recovering the original English names for the transliterated Chinese words, mainly 
the proper name, using search engine. We used two transfer dictionaries and a 
Chinese se ... 

12 Information access across the language barrier (demonstration ioo% 
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12 abstract): the MuST system 

Chin -Yew Lin 

Proceedings of the 22nd annual international ACM SIGIR conference on 
Research and development in information retrieval August 1999 



13 Cross-language information retrieval based on parallel texts and ioo% 
13 automatic mining of parallel texts from the Web 

Jian-Yun Nie , Michel Sinriard , Pierre Isabelle , Richard Durand 

Proceedings of the 22nd annual international ACM SIGIR conference on 

Research and development in information retrieval August 1999 
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